A comparative study of LPC parameter representations and quantisation schemes for wideband speech coding

نویسندگان

  • Stephen So
  • Kuldip K. Paliwal
چکیده

In this paper, we provide a review of LPC parameter quantisation for wideband speech coding as well as evaluate our contributions, namely the switched split vector quantiser (SSVQ) and multi-frame GMM-based block quantiser. We also compare the performance of various quantisation schemes on the two popular LPC parameter representations: line spectral frequencies (LSFs) and immittance spectral pairs (ISPs). Our experimental results indicate that ISPs are superior to LSFs by 1 bit/frame in independent quantiser schemes, such as scalar quantisers; while LSFs are the superior representation for joint vector quantiser schemes. We also derive informal lower bounds, 35 and 36 bits/frame, for the transparent coding of LSFs and ISPs, respectively, via the extrapolation of the operating distortion-rate curve of the unconstrained vector quantiser. Finally, we report and discuss the results of applying the SSVQ with dynamically-weighted distance measure and the multi-frame GMM-based block quantiser, which achieve transparent coding at 42 and 37 bits/frame, respectively, for LSFs. ISPs were found to be inferior to the LSFs by 1 bit/frame. In our comparative study, other quantisation schemes that were investigated include PDF-optimised scalar quantisers, the memoryless Gaussian mixture model-based block quantiser, the split vector quantiser, and the split-multistage vector quantiser with MA predictor from the AMR-WB (ITU-T G.722.2) speech coder. © 2005 Elsevier Inc. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New innovations in multi-pulse speech coding for bit rates below 8 kb/s

Recently, MP-LPC [1] and other similar analysis-bysynthesis (A-by-S) coding schemes have proved that it is possible to achieve very high quality coded speech down to around 8 kb/s. In this paper we report on simulation results of a multi-pulse coder with pitch prediction at bit rates of 8 kb/s and below. The effects of various long term prediction configurations have been evaluated both subject...

متن کامل

Wideband Speech Coding at 4 kbps using Waveform Interpolation

In this paper we present a new low rate, wideband speech coder operating at 4 kbps and based on Waveform Interpolation (WI). An outline of WI speech coding is provided together with a description of its adaptation to wideband speech. Particular emphasis is placed on the quantisation of the WI parameters. Included is a detailed analysis of the quantisation requirements for the Line Spectral Freq...

متن کامل

Wideband speech coding, speech spectral quantisation Speech Spectral Quantizers for Wideband Speech Coding

In this treatise a range of Line Spectrum Frequency (LSF) Vector Quantization (VQ) schemes were studied comparatively, which were designed for wideband speech codecs. Both predictive arrangements and memoryless schemes were investigated. Specifically, both memoryless Split Vector Quantization (SVQ) and Classified Vector Quantization (CVQ) were studied. These techniques exhibit a low complexity ...

متن کامل

High quality coding of wideband speech at 24 kbit/s

This paper proposes a Wideband-CELP-Coding scheme (bandwidth 7kHz) at 24 kbit/s. The codec introduces a delay of just 10 ms. This fulfills the requirements of a possible codec candidate for wideband speech coding within DECT or video applications [I]. The analysis-by-synthesis structure of the proposed Wideband-CELP-Codec includes an alternative LPC analysis concept, where the autocorrelation f...

متن کامل

Temporal decomposition: a promising approach to low rate wideband speech compression

In this paper, we present new results on Temporal Decomposition (TD) applied to the Line Spectral Frequencies (LSFs) derived for wideband speech. The paper shows that by incorporating a dynamic programming search algorithm into TD, near transparent quantisation of wideband LSFs can be obtained at approximately 1 kbps. We also show that TD performs significantly better than Split Vector Quantisa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Digital Signal Processing

دوره 17  شماره 

صفحات  -

تاریخ انتشار 2007